
Foundation Models
Currently invovled in researching one-shot and zero-shot learning for foundation models to add controlability on the output of these models.
Exploring the following conditioning information:
AI Research Engineer
Currently invovled in researching one-shot and zero-shot learning for foundation models to add controlability on the output of these models.
Exploring the following conditioning information:
ARRG is the task of generating radiologist like report for x-ray images. In this research I am exploring the usability of prior informaiton such as prior x-ray images and meta information for enhanced automated radiology report generation.
TATA (TisepX Advanced Translation and Analysis) API is used to deploy trained machine learning and deep learning model into windows/linux client applications.
I employed data structures best practices for storing and accessing data as well as proposed novel UX design by accompnaying alpha channel with the segmentation mask for better visualization of the results.
I also contributed in deploying of AI model on cloud using Amazon EC2 and integrating it with the backend Fast API.
In this project, I was in charge of extending the classes of body organ segmentation and integrating deep learning based segmentation. I deployed vision transformers (ViT) to extend the segmentation capabilities of the software and intergrated MONAI libarary alongside unsupervised learning to enable interactive segmentation.
Employed advanced techniques to accurately detect lesion regions in chest X-rays using unlabeled data.
Conceptualization and implementation of an AI model, integrating parameter back projection via GradCAM methodology, to enhance diagnostic accuracy and efficiency.
Research and development of algorithms for digital twins of human using medical images. (Registed a patnet on this)
Developed a sophisticated deformable/learnable head template that can fit on any 2D image to create 3D image of it.
Photo-realistic rendering of face and whole-body including rendering on NVIDIA omniverse
Ideated and developed a 2D texture mapping deep learning model that maps image texture on a 3D deformable object.
Research, development, and deployment of advanced AI applications for medical systems
Developing 3D medical image registration algorithms using deep learning
Initiating the Grand Segmentation Project for medical image segmentation (Pthon and C++)
Applying computer vision algorithm in real-time system software using (C/C++)
Plan, design, research and development of a comprehensive gaze estimation system software and android app. (Kotlin, Python and C++).
The gaze app achieves an average error of less than 3cm of L2 (Euclidean) distance on android devices - with a screen size of 10.1", and PCs - with a screen size of 24".
[Report]   [Demo]   [Code] Built, designed and delivered a sophisticated smart scale android app using ML algorithms, point cloud, OpenNI and OpenGL libraries.
The app can estimate and visualize pig's weight from a 3D scanned image. (Java)
The app achieves a 97% of accuracy in real world deployment in less than 7 seconds.
[Report]   [Demo]   [Code]Published in IEEE, CSCI-2020 international conference. In this paper I propose a novel data augmentation strategy that expands the size of the dataset to improve the performance of deep learning networks for image semantic segmentation.
[Paper]   [Demo]   [Code]Sediqi KM, Lee HJ. A Novel Upsampling and Context Convolution for Image Semantic Segmentation. Sensors. 2021; 21(6):2170. https://doi.org/10.3390/s21062170
Sediqi KM, Lee HJ. Improved Image Semantic Segmentation Based on Cascade Data Augmentation. In 2020 International Conference on Computational Science and Computational Intelligence (CSCI) 2020 Dec 16 (pp. 1607-1612). IEEE.
Khwaja Monib Sediqi,and Hyo Jong Lee. "Importance of context in semantic segmentation of images." 한국정보과학회 학술발표논문집 2020.7 (2020): 871-873.
Sediqi KM, Sung TL, Lee HJ. "Extended Temporal Convolutions for Human Action Recognition in Videos." International Symposium on Information Technology Convergence (2019)
Sediqi KM, Lee HJ. "Decomposed "Spatial and Temporal" Convolution for Human Action Recognition in Videos."한국정보처리학회 2019년도 춘계학술발표대회 (2020)